On-chip service processor

ABSTRACT

An integrated circuit is described that includes a stored program processor for test and debug of user-definable logic plus external interface between the test/debug circuits and the component pins. The external interface may be via an existing test interface or a separate serial or parallel port. Test and debug circuits may contain scan strings that may be used to observe states in user-definable logic or be used to provide pseudo-random bit sequences to user-definable logic. Test and debug circuits may also contain an on-chip logic analyzer for capturing sequences of logic states in user-definable circuits. Test and debug circuits may be designed to observe states in user-definable circuits during the normal system operation of said user-definable circuits.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.10/767,265, entitled, “On-Chip Service Processor,” filed on Jan. 30,2004 (which issued as U.S. Pat. No. 6,964,001), commonly-assigned, andincorporated by reference herein in its entirety. That application is acontinuation of U.S. patent application Ser. No. 09/275,726, entitled,“On-Chip Service Processor,” filed on Mar. 24, 1999 (which issued asU.S. Pat. No. 6,687,865), commonly-assigned, and also incorporated byreference herein in its entirety. That application, in turn, is entitledto the priority of U.S. Provisional Patent Application No. 60/079,316,filed on Mar. 25, 1998.

BACKGROUND OF THE INVENTION

The present invention is related to the testing and debugging ofelectronic systems, and, in particular, to on-chip circuits for the testand diagnosis of problems in an integrated circuit.

Heretofore, logic analyzer probes have often been used in the testingand debugging of electronic systems. The logic analyzer probes werecoupled to the external pins of components of a digital system in orderto capture the sequence of signals after a predefined event (or timestamp) occurs. The captured signals can then be examined to verifycorrect system behavior or, alternatively, to identify the time and thenature of erroneous behavior in the system.

Furthermore, in the designs of large electronic systems, separateconsoles, or service processors, have often been incorporated into thecircuit boards of the system. These separate processors have a number ofuseful functions, including the control of scan strings in the system;the origination of diagnostic signal probes to run on the system, and soforth. The service processors also have diagnostic and scan debugfeatures, including access to the internal registers and memory withinthe system. The service processors have also been used to bring-up themain system during its power up phase. All of these functions have beenuseful to system designers for the design, test and debugging ofelectronic systems.

On the other hand, more and more digital systems, or parts of digitalsystems, are being integrated in a single component. The resultingcomplexity and lack of observability of an integrated circuit posesserious problems for the test, debug and bring-up stages of theintegrated circuit (IC). For example, observation at the IC componentpins of the behavior of an IC system is increasingly difficult. The ICcomponent pins may be very far (in terms of logic hierarchy) from theactual points of interest. The extremely high frequency of digital ICoperations and the frequency filtering effects of the large capacitanceof the external logic analyzer probes, often prevents a logic analyzerfrom capturing signals reliably and precisely. There is always anuncertainty regarding the accuracy of signals captured by an externallogic analyzer compared to the actual signals values within the IC.

To address the problems of the testing of integrated circuits, specialfeatures are being included in many IC designs. For example, onestandard technique is “scan” whereby, certain internal flip-flops, whichare connected to various selected points of the IC, are also connectedto form a serial shift register when the IC is configured in a testmode. Straightforward serial shift (i.e., scan) operations are utilizedto load the flip-flops with desired values, or to read out their presentvalues reflective of the logic states of the selective IC points. SuchICs require special features to reset the flip-flops (i.e., bring the ICto a known starting state). However, the size of integrated circuits hasgrown to the point where it has become inefficient and expensive to testand debug ICs using solely conventional scan techniques.

Furthermore, variations of the serial scan technique include the use ofso-called “shadow registers.” IC internal signal states are captured ina duplicate copy, i.e., the shadow register, of certain internalregisters. The shadow registers are interconnected by a dedicatedinternal scan chain. A predetermined event can trigger a snapshot of theinternal state values in the shadow registers and the dedicated scanchain shifts the captured signal state without affecting the systemoperation of the IC. However, this approach has several deficiencies.First, only a single snapshot can be captured and shifted out with eachtrigger event. This greatly hampers debugging the IC since there is notmuch visibility of the system activity around a point of interestidentified by the trigger event. Secondly, the snapshots can be takenonly of those signals in registers which have a shadow registercounterpart. Since a shadow register effectively doubles the circuitryfor the register, this approach is very costly to implement on a largescale in the IC.

Another test and debug design for ICs is found in a standard, the IEEE1149.1 Test Access Port and Boundary-Scan Architecture, which prescribesa test controller which responds to a set of predetermined instructionsand an; instruction register which holds the present instruction whichthe controller executes. Each instruction is first loaded into theinstruction register from a source outside the IC and then thatinstruction is executed by the controller. While having some advantagesof versatility and speed, the standard still binds test and debugprocedures to the world external to the IC and thus, limits itsperformance.

The present invention recognizes that while the advances in ICtechnology have helped to create the problems of testing and debuggingan IC, the advances also point the way toward solving these problems. Inaccordance with the present invention, special on-chip circuits are usedto observe the internal workings of an IC. These circuits operate atinternal IC clock rates so that the limitations of the frequency ofsignals at the IC input and output (I/O) boundary are avoided. Many morepoints in the IC system are accessed than is feasible with conventionalexternal test and debug processors. Thus the present invention offersadvantages which exceed the straight-forward savings in chip space dueto miniaturization. Additionally, the present invention reduces theamount of test logic which might have been required elsewhere on thechip.

The present invention also permits the coupling of probes to internal ICpoints. The points may be selected from a larger number of internalpoints that may be observed with an external logic analyzer. Besides thegreater observability of the internal operations of the IC, the presentinvention also improves the accuracy of the observations, as compared toan external logic analyzer.

SUMMARY OF THE INVENTION

To achieve these ends, the present invention provide's for an integratedcircuit logic blocks, a control unit, a memory associated with thecontrol unit and a plurality of scan lines. The memory holdsinstructions for the control unit to perform test and debug operationsof the logic blocks. The scan lines are responsive to the control unitfor loading test signals for the logic blocks and retrieving test signalresults from the logic blocks. The test signals and the test signalresults are stored in the memory so that the loading and retrievingoperations are performed at one or more clock signal rates internal tothe integrated circuit. The integrated circuit also has a plurality ofprobe lines which are responsive to the control unit for carrying systemoperation signals at predetermined probe points of the logic blocks. Thesystem operation signals are also stored in the memory so that thesystem operation signals are retrieved at one or more clock signal ratesinternal to the integrated circuit.

The present invention also provides for an integrated circuit which hasan interface for coupling to an external diagnostic processor, a unitresponsive to instructions from the external diagnostics processor, aplurality of probe lines coupled to the unit, and a memory coupled tothe unit and to the interface. In response to the unit, the probe linescarry sequential of sets of system operation signals at predeterminedprobe points of the integrated circuit and the system operation signalsare stored in the memory at one or more clock signal rates internal tothe integrated circuit. The system operation signals are retrieved fromthe memory through the interface to the external diagnostic processor atone or more clock signal rates external to the integrated circuit. Thisallows the external diagnostics processor to process the captured systemoperation signals.

The present invention further provides for a method of operating anintegrated circuit which has logic blocks, a control unit, a memory anda plurality of scan lines of the logic blocks. The memory is loaded withtest signals and instructions for the control unit and the scan linesresponsive to the control unit are loaded with the test signals for thelogic blocks at one or more clock signal rates internal to theintegrated circuit. The logic blocks are then operated at one or moreclock signal rates internal to the integrated circuit and the resultingtest signal results are retrieved from the logic blocks along the scanlines at one or more clock signal rates internal to the integratedcircuit. The test signal results are stored in the memory at one or moreclock signal rates internal to the integrated circuit; and the storedtest results signals are processed in the control unit responsive to thestored instructions in the memory to perform test and debug operationsof the logic blocks.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 a shows a high-level diagram of an exemplary large and complexintegrated circuit; FIG. 1 b shows the FIG. 1 a integrated circuit witha Service Processor Unit (SPU), according to one embodiment of thepresent invention;

FIG. 2 illustrates one embodiment for the architecture for the SPU ofFIG. 1 b;

FIG. 3 a illustrates the coupling between test wrappers, scan strings,probe strings and range probes to a test bus; FIG. 3 b is a circuitdiagram of a test bus connector of FIG. 3 a; FIG. 3 c is an exemplaryconnection of multiple test bus connectors;

FIG. 4 a is a circuit diagram of a block input/output connector for testwrappers for observing test points outside a block along a boundary-scanchain (for example, IEEE 1149.1 standard Test Access Port and BoundaryScan Architecture); FIG. 4 b is a circuit diagram of a block scanconnector for scan strings for observing test points inside a blockalong a scan chain;

FIG. 5 is a circuit diagram of a scan flip-flop in the FIG. 4 b circuitdiagram;

FIG. 6 a is a circuit which generates an out-of-range detection probesignals for range probes; FIGS. 6 b and 6 c are the transistor-levelcircuits of inverters in FIG. 6 a;

FIG. 7 is a circuit which generates ground-bounce detection probesignals for range probes;

FIG. 8 is a block diagram of a Built In Self-Test (BIST) engine of theFIG. 2 SPU;

FIG. 9 a is a block diagram of an input aligner portion of AnalysisEngine of the FIG. 2 SPU; FIG. 9 b is a detail of the FIG. 9 a AnalysisEngine's input aligner; FIG. 9 c is a block diagram of the AnalysisEngine's memory addressing structure; FIG. 9 d is a block diagram of thetrigger logic portion of the Analysis Engine; and

FIG. 10 is a block diagram of another embodiment of the AnalysisEngine's memory addressing structure;

FIG. 11 shows a probe string connection of probe points to the buffermemory using logic analyzer channels that are implemented with probestorage elements (PSE);

FIG. 12 shows an alternative probe string connection with improvedmultiplexed PSEs which combine probe selection and data capturefunctions; and

FIG. 13 is a block diagram of the improved PSE of FIG. 12.

DESCRIPTION OF THE SPECIFIC EMBODIMENTS

General Organization of the Present Invention

In accordance with the present invention, a Service Processor Unit (SPU)is incorporated within an integrated circuit. Besides addressing theproblems of testing and debugging the IC, the availability of aprogrammable unit, such as the SPU, which may load or unload the statevariables into and from the user-definable logic in an IC, greatlysimplifies the problem of resetting the IC and observing its currentstate. The SPU is implemented in the form of a basic stored-programcontrol unit, such as a microprocessor, with a predefined instructionset, a number of extended function units (EFUs), program, data, andscratch pad memories, plus an input/output circuit for loading andunloading the SPU memories with data/programs from the outside world.This allows the SPU to be programmed to execute a control program whichinteracts with the various extended functional units to control varioustest and debug related activities on the IC.

Each EFU is designed to control a specific test or debug feature and theEFU provides the control unit a general, programmable access to thatfeature. For example, one EFU may be designed to control the executionof serial shift operations along some or all of the internal scan chainsof the IC. The other EFUs may be enabled to interact with the scanchains, such as a predetermined algorithm to provide a Built-InSelf-Test (BIST) for an embedded Random Access Memory (RAM) block. Theexisting scan chains load and unload the BIST patterns and resultsto/from the RAM block. The EFUs provide the control unit with a straightforward, programmable means for controlling the functions of the EFUsuch that knowledge of low level details of the scan or BIST functionsbecome unnecessary.

With its program and data memories, the SPU acts autonomously once itsprogram memory has been loaded with the desired instruction sequence.The SPU's program memory may be loaded with the desired programinstructions through the SPU's interface to the external environment.Alternatively, the instructions may be stored in an on-chip Read OnlyMemory (ROM) that has been provided to work as the SPU's program memory.

In one embodiment of the present invention, an EFU carries out certainfunctions of a logic analyzer. A logic analyzer captures and storessignal state values in a digital system following the occurrence of apre-defined event. The logic analyzer then analyzes the captured dataand displays the results for perusal. With the present invention, thecapture and storage functions are incorporated into the IC. The EFUwhich implements these functions captures and stores not a singlesnapshot but a sequence (i.e., history) of signal values using logicprobes which are selectively coupled to desired points in the IC logiccircuits. The logic analyzer EFU is configurable to select the location,number and sequential depth of signal channels from a predetermined setof choices. Thus, each logic analyzer channel may be selectively coupledto more than one predetermined capture point by programming the controlunit and hence, the EFU. A solution is provided for capturing thehistory of signal values at the internal points of the IC without havingto provide each one of these points with their shadow registercounterpart. The captured data are stored in an on-chip Random AccessMemory (RAM). Transportation of the captured data out of the IC isperformed later for analysis by an external computer which can reformatand display as required for diagnostics. The present invention has thebenefit of enhanced data accuracy with minimal cost overhead byseparating the signal capture/storage function of a logic analyzer intothe IC.

Two different types of logic probes may be used with the logic analyzerEFU. One type of logic probe, termed the digital probe, capturessequences of digital signals from internal points of the IC. Digitalsignal values flow from the internal capture point to a logic analyzerchannel through the digital probe. In its simplest form each digitalprobe has at least two input ports, a selection means and an output portthat is directly coupled to a logic analyzer channel. Digital probes mayalso be constructed from a series of internal storage elements (i.e.,flip-flops or latches) to form a pipeline to move the data from thecapture points towards the logic analyzer channels. In this case, themovement of the data along the digital probe flip-flops is synchronizedwith an on-chip clock signal. Since the clock frequency also defines themaximum capture rate, the particular clock signal is selected based onthe maximum desired capture rate. The digital probes used for the logicanalyzer EFU operate with the same electrical and timing characteristicsof the native signals of the IC. The digital probes are implemented inthe same technology, with the same functional logic circuitry, and underthe same clock timing, as the rest of the IC. Signals are thereforcaptured and propagated along the digital probes in exactly the same wayas they are operated upon by the functional circuitry of the IC. Thisassures much greater accuracy of signal states captured by the digitalprobes. In contrast, logic probes used with an external logic analyzermust use trigger events and signal values that are visible external tothe IC. The captured signal values may differ significantly from theoriginal (internal) values.

The logic analyzer EFU may use a second type of logic probe, termed ananalog probe, which captures signal events representing the detection ofsignal integrity conditions, such as ground bounce. Desired signalobservation points are coupled to analog detection circuits whichproduce digital signals when particular signal conditions are detected.The analog probe records these digital signal states in the logicanalyzer EFU.

The benefits of the logic analyzer EFU are such that for certain ICs,only the EFU portion of the SPU is implemented on the IC. In thisalternate embodiment of the present invention, the digital and analogprobes are selectively enabled by a scan-chain which allows specificcontrol signals to be loaded into these probe circuits. The scan chainalso carries other control signals to be loaded into a trigger circuitwhich starts and stops the data capture operations. Once the desireddata has been captured into an on-chip RAM, the data is transportedoutside the IC for subsequent analysis and display.

Implementations of the Present Invention

As a starting point, FIG. 1 a is a diagram of an exemplary integratedcircuit. The IC 100 is complex having a host processor connected by asystem bus to various circuit blocks, including a third party core andother blocks adapted to the application of the IC. The IC also has aperipheral bus which is connected to the system bus by a bridge. Theperipheral bus is connected to other functional blocks, such as auser-developed core and so on.

A preferred embodiment of the present invention to test and debug thecomplex IC of FIG. 1 a is shown in FIG. 1 b. Added to the IC 100 is aService Processor Unit (SPU) 101 which is coupled to the IC system bus105 and an added test bus 104. Connected to the test bus 104 are testwrappers 102 which provide test communication channels into selectedblocks 106. More details of the test bus 104 and test wrappers 102 areprovided below. The SPU 101 provides a connection for an externaldiagnostics console 103 to view and test the internal workings of the IC100.

As shown in FIG. 2, the SPU 101 has several extended function units(EFUs), including a control unit, such as a microprocessor 211, a buffermemory unit 218, an analysis engine 215, a scan control unit 222, aninterrupt handler 221, which is further connected to a range check unit220, a system bus interface 214, a test bus interface 213 and a built-inself test (BIST) engine 212, which are all interconnected by a processorbus 219. The various EPUs are coupled to the processor bus 219 in anydesired combination and order. To provide communication between theexternal world and the SPU 101, the bus 219 is also connected to aserial input/output (SIO) interface 210, a parallel input/outputinterface (PIO) 216, and a test access port (TAP) 217. For example, thecoupling between the IC 100 and the external diagnostics console 103,typically implemented using another computer, uses the TAP 217, the SIOinterface 210 or the PIO interface 216.

Analog probe lines 201 are connected to the range check unit 220 whichprocesses their values to detect out-of-range conditions which are thensignaled to the interrupt handler 221. The interrupt handler 221 alsoreceives signals from trigger event lines 204 directly or from test bus104 by way of test bus connections 203 to the interrupt handler 221. Thesignals on the trigger event lines 204 or test connections 203 are usedto capture signal state values when predetermined (i.e., triggering)events occur. The interrupt handler 221 passes the captured values tothe analysis engine 215. The test bus 104 is further coupled to testwrappers 102, which are individually wrapped around a number ofpredetermined blocks 106 on the IC 100. Each test wrapper 102 accessesthe input and output signals of a block 106. The test bus 104 is alsoconnected to scan string lines 403, which are connected to internalelements of a block 106.

As shown in FIG. 3 a, the test bus 104 forms a unidirectional loop withtest bus connectors 401 selectively transferring data between the testbus 104 and a test wrapper 102. The test bus 104 is made up of multiplebit lines, where the number of the bits is determined by therequirements of the test system. Through test bus connector 401, thetest bus 104 is selectively connected to test wrappers 102, scan stringlines 403, probe string lines 402 and trigger lines 204.

A test bus connector 401 which handles a one bit connection between thetest bus 104 and a test wrapper 102 is illustrated in FIG. 3 b. A firstmultiplexer 421 has one of its input terminals connected to one of thelines of the test bus 104. The other input terminal is connected to asignal line of the test wrapper 102. The output terminal of themultiplexer 421 is connected to an input terminal of a flip-flop 426 andto an input terminal of a second multiplexer 422, which has a secondinput terminal connected to the output terminal of the flip-flop 426.The output terminal of the flip-flop 426 is also connected to the lineof the test wrapper 102, which is also in the form of a unidirectionalloop. The multiplexer 421 selects either the data from the test bus 104or the test wrapper 102; the second multiplexer 422 selects between thedata selected by the first multiplexer 431 or the data captured in theflip-flop 426 to place back onto the test bus 104. These selections aredone under the control of SPU 101. The test bus connector 401 is also beused for coupling a trigger line 204, probe string line 402 or scanstring line 403 to a test bus 104 by connecting the desired signal linein place of the line of the test wrapper 102 port as shown in FIG. 3 b.

FIG. 3 c shows an embodiment of coupling a trigger line 204, probestring 402, test wrapper 102 and scan string line 403 to three lines ofthe test bus 104. Other possible configurations for the couplingsinclude coupling the test wrapper 102 and scan string 403 onto separatelines of the test bus 104.

A test wrapper 102 is formed by serially connecting block I/O connectorcircuits 310. One such circuit 310, which couples an input or outputsignal of a block 106 to the test wrapper 102, is illustrated in FIG. 4a. The connector circuit 310 has a scan-in terminal 304 and a scan-outterminal 306. The scan-in terminal 304 of one circuit 301 is connectedto the scan-out terminal 306 of another circuit 301 to form the serialchain of a test wrapper 102. The connector circuit 310 also has adata-in terminal 302 and a data-out terminal 307 which provide aninterstitial connection between a block 106 and the rest of the IC 100.In the normal operation of the IC, the connector circuit 310 provides asimple path between the block 106 and the rest of the IC 100. If theconnector circuit 310 is to provide an input signal to the block 106during test operations, the data out terminal 307 is connected to theblock 106 and the data in terminal is connected to the rest of the IC100. If the block I/O connector circuit 310 is to receive an outputsignal from the block 106 during test operations, the data-out terminal307 is connected to the rest of the IC 100 and the data-in terminal isconnected to the block 106. The connector circuit 310 also has aprobe-in terminal 303 and a probe-out terminal 305 which provide a pathfor probe signals from selected portions of the block 106 through theconnector circuit 310 to observe operations in the block 106.

The elements of the connector circuit 310 include a scan flip-flop 301and two multiplexers 308 and 309. The data-in terminal 302 and thescan-in terminal 304 form the inputs to the flip-flop 301. The outputfrom the flip-flip 301 include the scan out terminal 306 and one inputto the multiplexer 308 having an output which forms the data-outterminal 307. The second input to the multiplexer 308 is connected tothe data-in terminal 302, which is also connected to one input to themultiplexer 309. The probe-in terminal 303 forms a second input to themultiplexer 309 whose output forms the probe-out terminal 305. Thecontrol input of the multiplexer 309 is the output of the scan flip-flop301 (and is connected to one input of the multiplexer 308). The controlinput of the multiplexer 308 is a test control line 300 from the controlunit 311 of the SPU 101. The control signal on the line 300 selectswhether the functional signal at data-in terminal 302 or the signal heldin the scan flip-flop 301 is passed onto the data-out terminal 307. Whenthe control signal of the line 300 signal is not-asserted, i.e., normalmode, there is normal operational signal flow between the data-interminal 302 and the data-out terminal 307. On the other hand, when thecontrol signal on the line 300 is in asserted state, i.e., test mode,the current state of the scan flip-flop 301 is passed onto the data-outterminal 307; the data-in terminal 302 and the data-out terminal 307 areisolated from one another. The state stored in the scan flip-flop 301 isalso controls whether the signal at the data-in terminal 302 or theprobe-in terminal 303 is passed onto the probe-out terminal 305. In thismanner, data from another probe point which is connected to the probe-interminal 303 are selectively passed onto the probe-out terminal 305. Thesignal state in the scan flip-flop 301 value is controlled and observedusing regular scan operations of the test wrapper 102 through thescan-in and scan-out terminals 304 and 306. Of course, if observation ofan input or output signal of the block 106 by a probe string 402 is notrequired, the multiplexer 309 can be eliminated from the circuit 310.

A scan string 403 is formed by serially connecting block scan connectorcircuits 320. One such circuit 320, which couples an internal element ofa block 106 to the scan string 403, is illustrated in FIG. 4 b. Theconnector circuit 320 has a scan-in terminal 314 and a scan-out terminal316. The scan-in terminal 314 of one connector circuit is connected tothe scan-out terminal 316 of another connector circuit 320 to form aserial scan string 403. The block scan connector circuit 320 also has adata-in terminal 312 and a data-out terminal 317 which provide aninterstitial connection between internal elements of the block 106. Inthe normal operation of the IC 100, the connector circuit 320 is asimple path between the internal elements in the block 106. Theconnector circuit 320 also has a probe-in terminal 313 and a probe-outterminal 315 which provide a path for probe signals from selectedportions of the block 106 through the connector circuit 320 to observeoperations in the block 106.

The block scan connector circuit 320 has a scan flip-flop 311 and amultiplexer 319. The data-in terminal 312 and the scan-in terminal 314form the inputs to the scan flip-flop 311. The output from the flip-flip311 include the scan out terminal 316 and the data-out terminal 317. Thedata-in terminal 302 is also connected to one input to the multiplexer319. The probe-in terminal 313 forms a second input to the multiplexer319 whose output forms the probe-out terminal 315. A special circuit isused for the scan flip-flop 311 (and the flip-flop 301 of FIG. 4 a). Thecircuit, which is shown in FIG. 5 and is found in previous IC scandesigns, has separate scan-slave and data-slave sections. The separationallows a state signal which has been scanned into the scan flip-flop 311to remain unaffected by functional clock pulses that cause the flip-flop311 to capture signals on the data in terminal 312 so that they appearin the data-slave section and on the data out terminal 317. Theconnector circuit 320 acts as a simple conduit for signals within theblock 106. At the same time, the previously scanned-in signal, whichappears in the scan-slave section, selects whether signals at the datain terminal 312 or the output from another probe point which has beenconnected to the probe-in terminal 313 is to be passed onto theprobe-out terminal 315. A probe string 402 is created. Of course, if aninternal scan string 403 need not be connected to a probe string 402,the multiplexer 319 can be eliminated from the circuit 320.

A probe string 402 is formed by serially connecting the probe-interminal of a connector circuit 310 and 320 to the probe-out terminal ofanother connector circuit 310 and 320. The probe string 402 typicallyhas a set of selectively connected probe points. However, only one probepoint along each probe string 402 may be actively probed at any giventime. Thus the IC designer selects the probe points which are to beconnected along the same probe string 402 and determines the totalnumber of probe strings 402 that are to be connected to the individualbits of the test bus 104. This structure allows the IC designer greatflexibility to optimize the number of test bus 104 lines with respect tothe number of simultaneously observable probe points in the IC.

The probes described above are digital probes. Two analog probes areillustrated in FIGS. 6 a, 6 b, 6 c and 7. The range check unit 220receives inputs from the analog probes that comprise signals on athreshold check line 600 and a ground bounce line 700. The unittransmits these signals to the SPU 101. FIGS. 6 a, 6 b and 6 c show thecircuit which generate the signal for the threshold check line 600. Thecircuit is used for detecting extended intermediate voltage levels. Suchvoltage levels are most likely to occur on an on-chip bus which is incontention among multiple circuit drivers. The analog probe has twoinverters 601 and 602, which are both coupled to an Exclusive-NOR logicgate circuit. FIG. 6 b is a transistor diagram depicting the lowthreshold inverter 601, and FIG. 6 c is a transistor diagram depictingthe high threshold inverter 602. These inverters 601 and 602 exhibitswitching properties characteristic of a very low internal voltage, anda very high internal voltage device, respectively. Normally, the circuitin FIG. 6 a has a logic one (1) output level, but during transitions ofthe input signal, the outputs of inverters 601 and 602 may remain inopposite states for a period sufficient to cause the circuit to go to alogic zero (0) output level before returning to the logic one (1) outputlevel. This negative pulse can be captured by the SPU 101.

FIG. 7 shows a schematic diagram of a ground bounce detector circuitwhich generates the signals for a ground bounce line 700. In thiscircuit, a quiet (and true) ground terminal 701 is connected to anN-channel transistor 702, which gate is driven by a local groundconnection terminal 703. A periodic clock on a Reset terminal 706, whichis controlled from the range check 220, clears a pair of NAND gatesconfigured as a SR latch 704, and charges a capacitor 705 having oneterminal connected to the Set input of the SR latch. The second terminalof the capacitor 705 is connected to the quiet ground terminal 701. TheN-channel transistor 702 which is gated by the local ground dischargesthe Set line of the SR latch 704, which flips the state of the SR latch704 if the local ground falls above threshold. For example, a groundspike on the local ground may drive the local ground below threshold.The frequency and duty cycle of the Reset signal determines themagnitude and duration of a ground spike on the local ground to triggerthe probe. A variety of frequencies and duty cycles are created by therange check 220 to determine the severity of ground spikes. When theprobe is triggered, the probe produces a negative (0) value until resetby the Reset signal on the terminal 706.

Returning to the components of the SPU 101, FIG. 8 is a preferredembodiment of the BIST engine 212. A polynomial register 711 identifiesthe bits in a linear feedback shift register (LSFR) 714 which are usedto form an Exclusive-OR (XOR) function which generates pseudo-randomvalues. The polynomial register 711 is set by the microprocessor 211,which also initializes contents of the LSFR 714. The output of the LSFR717 is connected to the inputs of a multiplexer 715 which also receivesthe outputs of a mask shift register 712 and a pattern shift register713. The output of the multiplexer 715 is an input to the LSFR 714. Themask shift register 712 identifies the bit positions whose values areselected from predetermined bit patterns in mask shift register 713versus the bit positions which receive the pseudo-random valuesgenerated by the LFSR 714. The output of the multiplexer 715 is acombination of built-in-self-test and functional scan vectors. Thesefeatures are useful because random vectors work well only when thecontrols allow the random vectors to exercise most of the IC sectionunder test. If there are more than a few control lines, the probabilityof properly exercising the logic under test with random vectors is verylow. These features also allow the SPU 101 to generate regularlyrepeating patterns; for example, periodic patterns that may be useful ina memory test may be generated by the SPU 101 that may output the datato the section of logic under test via the test bus or the system bus,whichever has been provided with a connection to the SPU 101.

Another EFU of the SPU 101 is the analysis engine 215. FIG. 9 a shows anembodiment of the analysis engine 215 which, under the control of themicroprocessor 211, captures logic signals from the test bus 104. Thisis achieved by first setting either the scan flip-flops 301 of the blockI/O connector circuits 310 (FIG. 4 a) or the scan flip-flops 311 of theblock scan connector circuit 320 (FIG. 4 b) so that a boundaryconnection or an internal point connection of the target block 106 isselected for probing, respectively. Next, all flip-flops along the sameprobe string 402 are programmed (by the SPU 101) so that only signalsfrom the selected probe point are allowed to flow through the probestring 402 and arrive at the test bus connector 401. The multiplexer 421and the multiplexer 422 in the test bus connector 401 (FIG. 3 a) arecontrolled by the SPU 101 so that the signals on the probe string 402are passed along to the test bus 104. Finally, all remaining test busconnector circuits 401 along the same bit line of the test bus 104 arecontrolled by the SPU 101 so that they pass the probe signals along testbus 104. This allows the selected probe signal to arrive at the analysisengine 215 where it is captured for subsequent off-line analysis. Theinput terminals of a plurality of flip-flops 805, one for each bit lineof the test bus 104, form the input port 802 of the analysis engine 215.A digital phase locked loop (PLL) 802 has selectable clock outputs 803to each flip-flop 805 to tune when the data from each probe point is tobe captured. The output terminal of each flip-flop 905 is connected tothe input terminal of a variable First-In-First-Out shift register(FIFO) 804.

FIG. 9 b shows the circuit details of each variable First-In-First-Outshift register (FIFO) 804, each having a number of serially-connectedregister stages 812. Each register stage 812 has a multiplexer which,under control of a decoder 811, selects between the signal held in aflip-flop of that stage or the incoming signal to the stage to place onthe stage's output terminal. The shift depth of each variable FIFO 804is programmable by the SPU 101 by setting a count register 810 for eachbit feeding the analysis engine 215. The value in the count register 810is decoded by the decoder 811. The result controls the number ofregister stages 812 which are bypassed. This compensates for the pathdelay differences among the different probe points by realigning capturetimes of signals captured in the analysis engine 215.

The analysis engine 215 also has trigger logic which controls thecapture of data. FIGS. 9 c and 9 d show sections of the trigger logic, aprogrammable circuit which detects one or more events to stop theanalysis engine 215 from capturing new data. The data that has beencaptured up to that point is preserved in the buffer memory 218 of theSPU 101. The buffer memory 218 resides in the same address space as theRAM used by the SPU 101 but may be mapped to use high memory space inorder to prevent interference with the instructions and data stored inlow memory space. When the analysis-engine 215 collects data, it may beallowed to write over old data, keeping only as many most-recent cyclesof data as the buffer memory 218 can hold. The size of the buffer memory218 for the analysis engine 215 is determined by the designer of the IC.

The trigger logic has a start address counter 820 and a stop counter821, which are shown in FIG. 9 c. These counters are loaded by themicroprocessor 211. The trigger circuit also has an address counter 822which is designed to overflow at the highest memory address of thebuffer memory 218. At that point the start address is reloaded with thebeginning address of the high memory space which is reserved for thebuffer 218. This converts a random access memory into a FIFO register.The stop counter 821 decrements when a latched trigger signal line 824becomes set. Subsequently the analysis engine 215 collects data into thebuffer memory 218 from the variable FIFOs 804 for as many cycles asdefined by the value loaded into the stop counter 821. The system ICdesigner uses the buffer memory size and the value in the stop counter821 as two parameters to control the amount of data collected before andafter an event has been detected.

Also part of the trigger logic is a circuit which generates thetriggering signals on the trigger signal line 824. As shown in FIG. 9 d,the generating circuit is structured to form Boolean AND-OR logic 831out of individually selectable terms 832. The terms 832 are fed from apolarity programming logic circuit 833 that accepts individual triggervariables, Probe 1 through Probe N. In addition, the true or thecomplemented value for the output function can be selected through afinal level circuit 830. In one embodiment (shown in FIG. 9 d), theresult is also shifted into three successive flip-flops 834. Each of theflip-flops 834 drives one input of each of a plurality of multiplexers835. The other inputs of the multiplexers 835 are set to a logic one (1)level. Each multiplexer 835 is individually controlled throughprogrammable bits and the multiplexer outputs are logically ANDedtogether to form a signal, T[i], which represents the presence of thetrigger condition over four consecutive clock periods. The output fromthe AND gate 836 is passed to an AND gate 837 with inputs from thecorresponding AND gates 836 of duplicate circuits that produce T[0],T[1], through T[n] signals. The output of AND gate 837 is stored in alatch 838 to form the latched trigger signal on the line 824. Once thesignal is set, the latched trigger signal maintains its value until itis reset through reprogramming by the microprocessor 211. In otherembodiments, there may be more or fewer latches, and additional logic tomake adjustments to the phases (i.e., the relative clock cycle whensignal is received) of the individual signals.

Another embodiment of the trigger logic is shown in FIG. 10. Thisembodiment provides for the capability of reversing the data capturingfunction of the analysis engine 215 from continually capturing new datauntil the trigger is detected, to not capturing any data until a triggeris received. In the latter case, each time a trigger signal on the line824 is received, the analysis engine 215 captures new data for apreprogrammed number of cycles and then stops until the next latchedsignal on the line 824 is received. To enable this mode of operation,the trigger circuit shown in FIG. 10 causes the previous triggercondition to be cleared so that it may be recognized again. This mode isvery useful since it enables the capture of signals around (i.e., beforeand after) multiple occurrences of trigger conditions. The buffer memory218 is utilized more efficiently as the storage of unwanted cycles ofdata between the trigger points is not required. It is also possible toprogram the trigger logic so it uses an externally generated triggercondition 902 in place of an internally programmed event.

Program instructions and initial data values for executing programs toimplement the functions of the SPU 101 are loaded from the diagnosticsconsole 103 (see FIG. 1 b) into the buffer memory 218 of the SPU 101.Some of these programs may access the system bus 105 or the test bus104. A program can control which test wrapper 102 is accessed by usingthe test bus interface 213 in order to set control signals on the testbus 104. This allows the SPU 101 to read data from a test wrapper 102via the test bus 104 into the buffer memory 218 and then send said dataout to the diagnostic console 103. Typically, a separate programexecuted on the diagnostic console 103 displays this information in ahuman readable format as may be appropriate for the given application.

Programs executed by the SPU 101 can also read data from the diagnosticsconsole 103 via the SIO interface 210 or TAP interface 217, as shown inFIG. 2 b, and write data out to individual scan flip-flops on the testwrappers 102 via the test bus 104. Significant processing, for example,expansion, compaction, or intermediate storage of data can be done bythe SPU 101 utilizing the buffer memory 218. In other embodiments,control functions may be supplied directly from the TAP interface 217 orSIO interface 210 to the analysis engine 215 or BIST engine 212, via theprocessor bus 219 without involving the microprocessor 211. The SPU 101may be coupled to either the system bus 205, or a separate test bus 104,or both. The coupling to the diagnostics console 103 may be via the TAPinterface 217 or the SIO interface 210. The test bus 104 may be coupledto one or more test wrappers 102.

Another embodiment of the invention is defined in which the SPU 101 doesnot include an embedded microprocessor 211. In this case, the analysisengine 215 and the BIST engine 212 can access the buffer memory 218 andsystem bus interface 214 directly, following instructions received fromthe external diagnostics console 103. In this case, the loading of theconfiguration information and transfer of data to and from the analysisengine 215 is controlled using hardwired control signals. In thisembodiment, the analysis engine 215 is implemented in the form of anon-chip logic analyzer (OLA) which captures sequential snapshots of setsof signals. The selected signals form the digital probes 202. Theselections are achieved by coupling the signals for digital probes 202to the channels of the analysis engine 215 and turning-on enablingcircuits, if provided, to allow the signals on the digital probes 202value to be captured onto channels of the logic analyzer 215. As shownin FIG. 11, the channels of the logic analyzer 215 are formed from probestorage elements (PSE) 1000 to form, a distributed serial shift registerwhich acts as a pipeline to move data captured at a probe point towardsthe end of the logic analyzer channel where the data are stored inbuffer memory 218. Each channel of the analysis engine 215 contains zeroor more number of PSEs 1000 which are clocked by a common periodic clocksignal labeled “Cf” on a clock signal line 1001. The clock signal ischosen (at design time) from among the fastest frequency of clocksignals which are used in generating source signals to be captured bythe probes. This way all signals captured on the analysis engine 215channels arrive at the end of the channels after a fixed, predeterminednumber of clock cycles so that their cycle relationship to one anotheris preserved, regardless of the length (i.e., number of bits) of theindividual channels of analysis engine 215.

Subsequently, after the captured data has been transported to theexternal diagnostics console 103, software processes use the number ofPSEs 1000 on each channel of the analysis engine 215 to align the datawith respect to one another. The lengths (i.e. number of bits) of theserial shift registers on the individual channels of the analysis engine215 are determined at design time so that signal delays due to physicaldistances among the PSEs 1000 are sufficiently short to allow data to beshifted between consecutive bits of the shift registers in a singleclock cycle. If necessary, the number of stages of the shift registersmay be increased to satisfy this condition. Each channel of the analysisengine 215 is coupled to a different data input port of the buffermemory 218. The collective data applied to the ports of the buffermemory 218 is written to an address in memory which is identified by acommon address register 822 that advances under control of the periodicclock signal “Cf” on the line 1001.

FIG. 12 shows a preferred embodiment of a channel of the OLA 215 whichuses multiplexed PSEs 1000 to combine the selection of probe points andpipelining captured data into a single, efficient design. This enablesthe coupling one PSE 1000 to two probe points or another PSE 1000. Scanoperations shift a control signal into the PSE 1000 to program itself toselect one or the other of its input ports.

The details of a multiplexed PSE are shown in FIG. 13. The PSE 1000,illustrated by a dotted line, is connected to a multiplexer 1108 whichhas two input terminals connected to two input probe paths, P1 and P2,for the logic analyzer channels. Besides the probe clock signal line1001, which carries the Cf signal, the PSE 1000 is connected to a firstscan clock signal line 1101, which carries an A_clk signal, a secondscan clock signal line 1102, which carries a B_clk signal, and a scancontrol line 1103, which carries a Scan_mode signal. The PSE 1000 hasthree latches 1105, 1106 and 1107. The output terminal of the latch 1105is connected to one input terminal of the latch 1106 and to one inputterminal of the latch 1107. One input terminal of the latch 1105 isconnected to the output terminal of the multiplexer 1108 and a secondinput terminal of the latch 1105 forms a scan data input terminal 1104,SI. The output terminal of the latch 1107 forms a scan data outputterminal, SO, and is also connected to the control terminal of themultiplexer 1108. The output terminal of the latch 1106 forms an outputprobe path, Q, for the logic analyzer channels.

The scan clock signals, A_clk and B_clk respectively, and the Scan_modesignal configure the PSE 1000. For serial shift operations, the serialinput (SI) on the line 1104 is captured into the latch 1105 when theA_clk signal is applied and the output of the latch 1105 is capturedinto the latch 1106 when the B_clk signal is applied. If the Scan_modesignal on the line 1103 is set to a logic 1, the B_clk signal on theline 1102 is also passed through a multiplexer 1109 and an AND gate 1112to the latch 1107 by a clock signal line 1111. Thus, non-overlappingA_clk and B_clk signals on the clock signal lines 1101 and 1102respectively clock serial shift operations in the PSE 1000. Signalsscanned into the latch 1105 through line 1104 are also scanned into thelatch 1107 (and the latch 1106) and the SO output terminal. Thiscompletes the programming of the PSE 1000 such that value that has beenloaded into the latch 1107 controls input multiplexer 1108 which selectsbetween two input ports 1109 and 1110. Once the PSE 1000 has beenprogrammed, the Scan_mode signal on control line 1103 signal is set toand maintained at logic 0 until the PSE 1000 is programmed with a newvalue. When the Scan_mode signal is set to logic 0, the PSE 1000performs its normal data capture function using the clock signal Cf onthe line 1001. The Cf clock signals are passed by the multiplexer 1109to the latch 1106 by a clock signal 1110. The latch 1106 captures thesignals from the latch 1105 and the multiplexer 1108 at the Cf clockrate and passes the signals out to the Q output terminal. Themultiplexed-PSEs shown in FIGS. 12 and 13 build cost efficient logicanalyzer channels.

Once enabled, the analysis engine 215 captures new values first into theflip-flops along the OLA channels and subsequently into the buffermemory 218 using trigger signals that have been pre-programmed andimplemented as shown in FIGS. 9 c, 9 d and 10.

In one mode of operation of the IC 100 shown in FIG. 1 b, the humanengineer may use the diagnostics console 103 to initialize both of thesystem logic and the SPU 101. In this manner, the SPU 101 may beprogrammed to perform logic analyzer functions and specific probe pointsmay be enabled so that a history of data values appearing at theselected probe points can be captured by SPU 101. Additionally, thetrigger logic shown in FIGS. 9 and 10 may be programmed to select adesired trigger event in order to stop the data capture operations.Next, the diagnostics console 103 invoke the IC 100 to execute itsnormal system operations. If and when the selected trigger event isdetected and the analysis engine 215 has captured the required data, thediagnostics console 103 instructs the SPU 101 to transfer the captureddata values out of the IC 100 and into the diagnostics console 103 wherethe data may be formatted and presented for analysis and interpretation.The diagnostics console 103 and the SPU 101 can constrain some of thesignals on one or more test wrappers 102 in order to affect the behaviorof the IC 100 and perform logic analysis under these conditions. Forexample, this approach may be useful to determine how the overallbehavior of the IC 100 is affected when some of the functionality of anyone of the blocks 106 is disabled.

In a different mode of operation automatic test equipment (ATE) mayaccess the IC 100 through its TAP interface 217 in order to initializethe SPU 101 so that internal scan strings 403 and test wrappers 102 areloaded with predetermined test values. The response of the blocks 106 isobserved using the scan strings 403 and test wrappers 102. Furthermore,the ATE may be programmed to instruct the SPU 101 to execute BIST orother buffer memory 218 test functions and to check the results todetermine pass or fail conditions.

In yet another mode of operation, it is possible to use an in-circuittest (ICT) or similar board-level test equipment to access the IC 100through its TAP interface 217 in order to instruct the SPU 101 toturn-on its external memory test function. In this mode, patterns aregenerated by the SPU 101 and made to appear at specific I/O pins of theIC 100 which are coupled to external memory. For example, the IC 100 maygenerate the data and address values that are applied to the externalmemory. The data responses received are captured in order to determineif the external memory is functioning correctly.

While the description above provides a full and complete disclosure ofthe preferred embodiments of the present invention, variousmodifications, alternate constructions, and equivalents will be obviousto those with skill in the art. Thus, the scope of the present inventionis limited solely by the metes and bounds of the appended claims.

1. An integrated circuit comprising: one or more logic blocks generatingone or more system operation signals at one or more system operationclock rates; and a service processor unit to perform test and debugoperations of said logic blocks of said integrated circuit, furthercomprising: a control unit; a buffer memory; an analysis engine; a scancontrol unit, a test bus interface; and a built-in self test (BIST)engine; wherein, in a first mode of operation, said service processorunit performs capture and analysis of system operation signals duringnormal system operation through said test bus interface.
 2. Anintegrated circuit as in claim 1, wherein, in a second mode ofoperation, said service processor unit performs tests on one or more ofsaid one or more logic blocks through said test bus interface.
 3. Anintegrated circuit as in claim 1, the service processor unit furthercomprising at least one of the ports selected from the group consistingof: a parallel I/O (PIO) port, a serial I/O (SIO) port, and a JTAG port;wherein data and instructions are sent through said at least one of saidports to said service processor unit from an external diagnosticsconsole, and result data is sent through said at least one of said portsfrom said service processor unit to said external diagnostics console.4. An integrated circuit comprising: one or more logic blocks generatingone or more system operation signals at one or more system operationclock rates; and, a service processor unit to perform test and debugoperations of at least one of said logic blocks of said integratedcircuit, further comprising: a control unit; a buffer memory; ananalysis engine; and a system bus interface; wherein, in a first mode ofoperation, said service processor unit performs capture and analysis ofsystem operation signals during normal system operation through saidsystem bus interface.
 5. An integrated circuit as in claim 4, wherein,in a second mode of operation, said system processor unit performs testson one or more of said one or more logic blocks through said system businterface.
 6. An integrated circuit as in claim 4, the service processorunit further comprising at least one of the ports selected from thegroup consisting of: a parallel I/O (PIO) port, a serial I/O (SIO) port,and a JTAG port; wherein data and instructions are sent through said atleast one of said ports to said service processor unit from an externaldiagnostics console, and result data is sent through said at least oneof said ports from said service processor unit to said externaldiagnostics console.